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Ajtai-Szemeredi Theorems over quasirandom 
groups 


Tim Austin 


Abstract Two versions of the Ajtai-Szemeredi Theorem are considered in the Carte¬ 
sian square of a finite non-Abelian group G. In case G is sufficiently quasirandom, 
we obtain strong forms of both versions; if i? C G x G is fairly dense, then E con¬ 
tains a large number of the desired patterns for most individual choices of ‘common 
difference’. For one of the versions, we also show that this set of good common 
differences is syndetic. 


1 Introduction 

A classical result of Ajtai and Szemeredi [1] asserts the following. For every ^ > 0 
there is an < oo such that, if G is a finite cyclic group with |G| > Nq, and 
E C G X G satisfies 


\E\>S\G\\ (1) 

then there are x,y G G and r G G \ {0} such that 

ED{{x,y),ix + r,y),{x,y + r)}. (2) 

A set satisfying (1) is called i5-dense in G x G. Subsets of G x G of the kind on the 
right-hand side of (2) are called Abelian corners. 

Several proofs of this are now known, and it has been generalized to subsets 
of G X G for arbitrary finite Abelian groups G: see, for instance, the discussions 
around [21, Propositions 10.47 and 11.28]. 
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All known proofs actually give a stronger conclusion: that there is some c > 0 
depending only on 6 with the property that any (5-dense set i? C G x G must satisfy 

\{{x,y,r) € : {{x,y), {x + r,y), {x,y + r)} C E}\ > c\G\^. (3) 

This paper considers two candidate generalizations of these results to non- 
Abelian groups G. The most obvious non-Abelian analogs of Abelian corners are 
triples of the form 


{{x,y),{gx,y),{x,gy)}, g ^ e, (4) 

where e is the identity of G. We refer to these as naive corners. A second possibility 
is triples of the form 


{{x,y),{gx,y),{gx,gy)}, g ^ e. (5) 

Triples of this second kind first appeared in [5] (in the related setting of infinite 
amenable groups and their probability-preserving actions), so we refer to them 
as Bergelson-McCutcheon-Zhang (‘BMZ’) corners. Solymosi [20] has studied 
BMZ corners in finite groups in; he writes them as 

{{x,y),{xg,y),{x,gy)}, g ^ e, 

but these may be identified with BMZ corners by first writing in terms of y, g and 
x' := xg and then applying the transformation (a, h) i—>■ (a“^, b). 

When G is Abelian, naive corners are Abelian corners, and BMZ corners are 
equivalent to them by a simple change of variables, but this is not true for non- 
Abelian G. As discussed in [5], when searching for BMZ corners inside a i5-dense 
subset of G X G, some methods from the Abelian setting have useful generalizations 
which do not seem to apply to naive corners. 

Either of these patterns, and indeed many others, must appear inside a i5-dense 
subset of G X G for any sufficiently large finite group G. This is simply because 
such a G must contain a fairly large Abelian subgroup [11], to whose cosets one can 
apply the known results for Abelian groups. 

Our first theorem states this result formally. Here and for the rest of the paper, we 
also allow G to be a non-finite compact metrizable group, since our methods handle 
these with little change. We generally refer to these simply as ‘compact’ groups, 
suppressing the assumption that they are metrizable. If G is a compact group then 
its Haar probability measure is denoted by me', if < G is a closed subgroup and 
g G G, then rUgH and niHg denote respectively the left- and right-translates of mH, 
regarded as a measure on G, by the element g. 

Theorem A. For any b > 0 and any k G N there is an Nq < oo for which the 
following holds. IfG is a compact group with |G| > Nq (in particular, if G is not 
finite), and E C G^ is Borel and satisfies rriQk (E) > S, then there are 

(xi, ... ,Xk) G G^ and gGG\{e} 


Ajtai-Szemeredi Theorems over quasirandom groups 


3 


such that 

ED : (?7i,...,?7fc) G {0,1}'=}, 

where for each i < kwe set x^ := Xi and x\ := gxi. 

The analogous result in which we set x\ := Xig follows immediately, simply by 
applying Theorem A to the set 

e e}. 

The simple proof of Theorem A will be given in Section 2. However, the proof 
does not give a generalization of the ‘counting’ result of inequality (3) to non- 
Abelian G, because it counts only those patterns for which the parameter g lies 
in some Abelian subgroup of G, which may be very small compared to G itself. 

It seems important to begin this paper with Theorem A, but our main results are 
of a different kind. These assert that, if G is sufficiently quasirandom (as defined 
in [13]) in terms of 5, and ED G x G is i5-dense, then there is a large set of possible 
‘common differences’ g such that E must contain both many naive corners and 
many BMZ corners with that specific choice of g. In contrast to the Abelian setting, 
the assumption of quasirandomness allows one to obtain reasonable and explicit 
bounds. The current best bounds in the Abelian case are those of Shkredov [18, 19]. 

Theorem B. Let e > 0 and suppose that G is a D-quasirandom compact group. 
Let E D G X G be Borel and set 5 := mcxciE). Then the set 

A:={pGG: mGxG{(.x,y): {{x,y),[gx,y),{x,gy)} Q E] > 5^ - e} 

has mciA) > 1 — je. In particular, if D ^ A is most of 

G. 


This is proved in Section 4. As far as I know, this is the first result for naive 
corners in non-Abelian groups, although the proof uses only some simple estimates 
and classical representation theory. 

Similar methods give an analogous result for BMZ corners, although the bounds 
we obtain are more complicated, and it seems unlikely that they are optimal. In 
formulating the results for BMZ corners, we no longer take care over the exact 
values of constants: universal constants are simply denoted by 0(1). 

Theorem C. For any e G (0,1/2) there is a 

D = exp ((l/e)°(')) (6) 

for which the following holds. Let G be a D-quasirandom compact group, let E C 
G X G be Borel, and set 5 := mcxciE). Then the set 

B -.= {g€G -. mGxG{{x,y)- {{x,y),{gx,y),{gx,gy)} D E) > - e] 


has mG{B) > 1 — e. 
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In the setting of Theorem C, a closely related argument proves another fact about 
the set B of ‘good common differences’. 

Theorem D. For any s £ (0,1 /2) there is a 

i:) = exp(exp((l/e)0(i))) (7) 

for which the following holds. If G is a D-quasirandom compact group and E C 
G X G is Borel, then the set B defined in Theorem C is K-left-syndetic for some 

it: = exp((l/e)0(i)). 


This gives a different sense in which the set B is ‘large’. We henceforth abbrevi¬ 
ate ‘/T-left-syndetic’ to ‘/T-syndetic’. 

Qualitative versions of Theorems C and D have recently been proved by Bergel- 
son, Robertson and Zorin-Kranich [6]. Their work uses ergodic theory for actions 
of certain ultraproducts of sequences of finite groups, which are of course highly 
infinite; it gives no explicit control ot\ D ox K in terms of e. Our proof below uses 
only elementary inequalities and representation theory, so should be simpler for the 
reader not versed in non-standard analysis. In this respect, the present paper relates 
to [6] as did the earlier work [3] to [7]. We prove Theorems C and D in Section 5. 

Theorems B, C and D together raise the following. 

Question 1. Is it true that, if G is sufficiently quasirandom in terms of e G (0,1/2), 
then the set A from Theorem B is iT-syndetic for some K depending only on el <1 

Intuitively, quasirandom groups are very far from Abelian groups. Taken to¬ 
gether, the Ajtai-Szemeredi Theorem for Abelian groups and our Theorems B and 
C make it natural to ask whether a lower bound of the kind in (3) holds for either 
naive or BMZ corners and for all finite groups G that are sufficiently large. 

In fact, this is true for BMZ corners, since the original graph-theoretic proof of 
Ajtai and Szemeredi generalizes fairly easily to those; see [20]. This observation has 
a counterpart in ergodic theory, where related questions about multiple recurrence 
have been studied [5, 4, 10]. This is why the pattern (5) was first introduced into 
ergodic theory by Bergelson, McCutcheon and Zhang in [5]. 

However, I do not see a way to apply that graph-theoretic argument in looking 
for naive corners, so for those the following question may be of interest. 

Question 2. Is it true that for every (5 > 0 there is a c > 0 such that, whenever G is 
a compact group and E Q G x G satisfies mQ 2 (E) > 6, one also has 


'mG^{{x,y,g) G G^ : {{x,y), {gx,y), {x,gy)} E} > cl 


< 
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2 Proof of Theorem A 


Proof (Theorem A). Step 1: finite groups. The theorem is well-known among 
finite Abelian groups; see, for instance, [21, Proposition 11.28]. Fix S > 0 and 
k G N, and let n be so large that, if A is any finite Abelian group of order at least 
n and F C satisfies rri/^k {F) > S, then F must contain a pattern of the kind in 
Theorem A. 

By a classical result of Erdos and Straus [11], we may now choose N so large 
that any finite group G of order at least N contains an Abelian subgroup H of order 
at least n (see also [15] for an essentially optimal estimate of N in terms of n). 
Suppose that E C has rriQk {E) > 5. Then 

mGk{E) = j ■■■ j TTiHg^x-xHgkiE) mH\G{d{Hgk)) ■ ■ ■ mH\G{(i{Hgi)), 

so there are some cosets Hgi,Hgk for which 


mHgixHg^X-xHgkiE) > 6. 


Define F C by 


En {Hgi X Hg 2 x • • • x Hgt) = E ■ (pi,... ,pfe). 
Applying the Abelian case of the theorem to F gives some 
{xi,... ,Xk) G and hGH\{e} 


such that 

F A : (pi,..., pfc) € {0,1}'=}, 

with the obvious analog of the notation in the statement of Theorem A. Translating 
back to E, this gives 

E A {{{xigiY^,... ,{xkgky'‘) : (rji,... ,r]k) £ { 0 , 1 }'=}. 

Step 2: other compact Lie groups. If G is a non-finite compact Lie group, then 
it contains a nontrivial toral subgroup. This, in turn, contains finite cyclic subgroups 
of arbitrarily large cardinality. Letting H such a subgroup of cardinality at least n, 
we may complete the proof as in Step 1. 

Step 3: general compact groups. Linally, let G be an arbitrary compact group 
which is not finite or a Lie group. As a standard consequence of the Peter-Weyl 
Theorem [8, Section III.3], there are a continuous surjective homomorphism tt : 
G —^ G to a compact Lie group such that the set 

E := {(iri,... ,xfc) £ g" : > 1 - 2-'=-i} (8) 
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satisfies 

mQk{EA(Tr x • • • x Tr)~^{E)) < (5/2, 
and hence m-^k (E) > 5/2. 

Since G is not finite or a Lie group, we may choose G to have cardinality as large 
as we please (allowing infinity if necessary). Having done so, either Step 1 or Step 
2 gives some 

__ 

{xi,... ,Xk) G G and gGG\{e} 


such that 

E2{ixr,...,xl^)-. 

with the obvious analog of the notation in the statement of Theorem A. 

Finally, consider lifts xi,.Xk and g chosen independently at random from the 
Haar measures ■■■, rn- 7 r-i{xfc} Observe that each gxi is 

then a random lift of with distribution Define each x/' using these 

lifts in the usual way. Then it follows from the definition (8) that each of the events 


{xW...,xf) G E for(pi,...,pfc) G {0,1}'= 


has probability at least 1 — 2“'=“^. Therefore, by the first-moment bound, there is 
some choice of xi,Xk and g for which all of these events occur simultaneously. 
This completes the proof. □ 


3 Preliminary discussion of the results for quasirandom groups 

The proofs of Theorems B, C and D have several common elements. This section 
introduces some of those. 

It will be convenient to use some simple notation and terminology from er- 
godic theory. Given a compact group G, a probability (G x G)-space is a tuple 
(Y, 27, 1 /, S, T) in which {Y, 27, v) is a probability space and S and T are two com¬ 
muting, j^-preserving G-actions on that space. Since they commute, S and T to¬ 
gether define an action of G x G, hence the name. Given such a probability (G x G)- 
space, we will write 27"®, 27^ and for the cr-subalgebras of sets in 27 that are 
invariant under S, T, or the whole (G x G)-action, respectively. 

For example, let 2^ := G x G with the measure fi = mx, let 27x be the Borel 
(7-algebra of X, and define 

S%x,y) := {gx,y) and T^{x,y) := {x,gy). 

This turns {X, Ex, y, S, T) into a probability (G x G)-space. It will appear repeat¬ 
edly below. 

Now consider measurable functions fi, f 2, f 3 ■ X — > [—1,1], and suppose G 
is highly quasirandom. For Theorems B, C and D we need to estimate the values 
taken by 
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either 


J fi ■ ■ hTS or y A • hSa . 


for ‘typical’ group elements g. We ultimately need to do this in case /i = A = /a = 
1^;, but allowing more flexibility will be important for the proofs. The strategy for 
these estimates takes a form that has become well-known in additive combinatorics: 
each of the functions A will be decomposed as 



in such a way that the functions contribute very little to the integrals of inter¬ 
est for ‘most’ group elements g, while the functions f° have some extra ‘structure’ 
which makes the estimate of that integral easier. In such a decomposition, the neg¬ 
ligible terms are often called ‘quasirandom’. Here, ‘most’ group elements will 
mean those lying in a very large subset of G in the case of Theorem B or C, or those 
lying in a suitable syndetic subset in the case of Theorem D. A nice discussion of 
this methodology can be found in [14]. 

Different decompositions are required for studying naive and BMZ corners. For 
the former, the required decompositions are fairly simple, and will be introduced 
within the proof of Theorem B. For BMZ corners we need a rather more complicated 
construction, based on the regularity lemma of Frieze and Kannan [12]; this will be 
explained separately in Subsection 5.1. 


3.1 Estimates from probability 

The following simple lemma plays the role of the classical van der Corput estimate 
in the present paper. It will be the basis for several other estimates later. 

Lemma 1. Let (Y, S, v) be a probability space, let V be a real or complex Hilbert 
space with inner product (•, •) and corresponding norm || • ||, and let y ^ Uy be a 
strongly measurable function Y —> V. Let v be a unit vector in V. Then 



Proof. Define 

ip{y) := {v,Uy)l\{v,Uy)l 

using the convention that p{y) = 1 if {v,Uy) = 0. So (p takes values in the unit 
circle, and more specifically in {—1,1} if V is real. This is a measurable function 


of y, and 
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\{v,Uy)\v{dy) = J ip{y){v,Uy)v{dy) = (v, j y}{y)uyv{dy) 

J V^{y)uy i^{dy) = ^ jj{ip{y)uy,ip{y')uy')v{dy)v{dy‘ 


< 




We will also need the following very general probabilistic estimate. It can be 
found as [9, Lemma 1.6], and then [6] cites it for a similar purpose to ours. 

Lemma 2. Let (Y, S, v) be a probability space, let Si, ..., S^ be a-subalgebras 
of S, and let f be a bounded non-negative measurable function on Y. Then 

k 

J f-f[E{f\S,)du> ( J 
2=1 


□ 


3.2 Estimates from representation theory 

All of the representation theory in this paper concerns either unitary representations 
on complex Hilbert spaces or orthogonal representations on real Hilbert spaces. 

The results we need from representation theory are all standard, and can be found 
in many textbooks. A good reference for our purposes is [8, Chapters II and III]; that 
book actually focuses on compact Lie groups, but all the facts we cite from it clearly 
hold for arbitrary compact groups. 

If TT : G rv L is a unitary or orthogonal representation, then P'^ denotes the 
orthogonal projection from V onto the subspace of 7r(G)-fixed vectors. 

A unitary (resp. orthogonal) representation is D-quasirandom if it has no ir¬ 
reducible subrepresentations of complex (resp. real) dimension less than D. Fol¬ 
lowing [13], the group G itself is £)-quasirandom if and only if all its non-identity 
irreducible unitary representations are D-quasirandom. Quasirandomness will be 
exploited in this paper by way of the following two lemmas. 

Lemma 3. If G is a D-quasirandom compact group, n : G r\ V is a unitary 
representation, and u,v G V, then 

®v)- P^u ® ^ D-^^^\\u\\v\\v\\v, 

where || ■ ||y denotes the norm on V and || • ||y(g)y denotes the Hilbertian tensor 
product of that norm onV ®V. 
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Proof. This is a routine consequence of Schur’s Lemma: it can be found as [3, 
Lemma 2]. □ 

Lfnlike the above, the next lemma will be needed for both unitary and orthogonal 
representations. 

Lemma A. If G is a D-quasirandom compact group, -k : G r\ V is a unitary or 
orthogonal representation, and u,v € V, then 

J\{u,ttSv)v - {P""u,P^v)v\'^ dg < D"i||u|||.||w||^. 

Proof In the unitary case, this is [3, Corollary 3] (although the proof given there 
contained an error, corrected in [2]). 

Now suppose that tt is an orthogonal representation. Then its complexihcation 
■k'^ ■. G rx V (8)r C is a unitary representation, which may also be regarded as an 
orthogonal representation isomorphic to tt 0 tt (see, for instance, [8, Section II. 6]). 
A simple calculation shows that P'^ is simply the complexihcation of P'^. Hence 
the desired inequality for tt follows from its counterpart for -k'^. □ 


4 Naive corners 

This section proves Theorem B. The key to this proof is the following proposition, 
which roughly asserts that ‘correlations’ across naive corners almost vanish unless 
one starts with functions that have some nontrivial structure. 

Let {X, Xx-, P, S, T) be as in Section 3. 

Proposition 1./i,/ 2 ,/a : X —>• [—1,1], and either E (/2 | = 0 or 

E(/3 I El) = 0, then 


j\j h- hS^ ■ hT^ clp| dg < 7312-1/4. 


Proposition 1 will in turn be deduced from a kind of ‘mixing’ estimate, formu¬ 
lated in the next lemma. Before stating the lemma, it will be helpful to have some 
notation for representations of product groups. In this section the representations 
will all be unitary, even though the functions in Proposition 1 are real-valued, since 
we will need to make another direct appeal to the Schur Orthogonality Relations. 
Given any two unitary representation tt ■. G r\V and 9 : G r\ W, one obtains a 
representation GxGr\V<SiW hy dehning 

{TrMe)<^3,h) (g,gh_ 

It is a standard result that tt K 0 is an irreducible (G x G)-representation if each of tt 
and 9 is irreducible, and that all irreducibles of G x G arise this way [8, Proposition 
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II.4.14]. (On the other hand, as far as I know there is no standard notation for 7rK0.) 
It follows at once that if G is O-quasirandom, then so is G x G. 


Lemma 5. Let G be D-quasirandom, let p : G x G r\ U be a unitary representa¬ 
tion, and let u G U. Then 


j ^udg — P^u 


<D-^u\\. 


Proof. By decomposing p into irreducibles, it suffices to prove this when p is itself a 
nontrivial irreducible. According to the discussion above, this means that p = ttMO 
for some irreducible unitary representations tt ■. G r\V and 9 : G r\ W, not both 
the identity. 

In this case we will prove that 


J "\v (g)w)dg = J {tt^v ® ' w) dp = I ® ^ (9) 


for any v G V and w G W. Since tt and 9 cannot both be the identity, if they 
are equal then dim(I4) > D, by the assumption of quasirandomness. Therefore (9) 
implies that 


n(9.9 


’udg 


< D- 


VuGU. 


To prove (9), it suffices to check the inner products of the two sides of this equa¬ 
tion against another element of U, which can also be of tensor product form. Thus 
the desired equality becomes 


J (yK^v®9^ w,v'®w')dg = J {Tr^v,v'){9^ w,w')dg 


0 else. 


This is now one of the standard Schur Orthogonality Relations: see [8, Theorem 
II.4.5(ii)]. □ 

Corollary 1 . If G is D-quasirandom and Fi,F 2 G L^{G x G), then 

^ dp- [ Fidp [p^dp dg <2 D-^^^\\Fi\\2\\F2\\2. 


Remark 1. The use of F 2 rather than F 2 on the left-hand side here is only for the 
sake of convenience. With this choice, the integral J Fi ■ F 2 T^S^ d/r is the Her- 
mitian inner product in L‘^{G x G), which leads more easily to an application of 
Lemma 5. <1 


Proof By replacing each Fi with Fi — J Fi dp, we may assume that they are both 
orthogonal to the constant functions. In this case we will show that 
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F^-F2TSS0 df, dg<2D-^\\Fi\\qF2r, 


from which the result follows by the Cauchy-Bunyakowksi-Schwartz inequality. 

This follows using a standard tensor-product trick. Let X := X x X, J1 := g, 

S := S X S and T := T x T. Then the integral above is equal to 


F 2 TSSF" dpd 5 = ^ Fi ( y F2TSSS~" dp) djl, 


where Fi{x, y, x', y') := Fi{x, y)Fi{x', y'). Applying LemmaS to the inner integral 
on the right here, this is at most 


I E(r-, I E^/)E{F, I r|*‘)dp + D-‘||r-illi.,5|||F2lliJ(f) 

< l|E(fd v|LL=,ii,||E(A I I|f2f. 

Finally, in view of the product form of Fi and the fact that G x G is still D- 
quasirandom, an appeal to Lemma 3 gives 

for* = l, 2 . 

Substituting this into the bound above completes the proof. 

Proof (Proposition 1). We give the proof in case E (/2 | S^) = 0, the other case 
being analogous. 

For each g € G, let 

ug := f2S^ ■ hTF 
By Lemma 1, it suffices to prove that 

JJ \{ug,Ug>)\dgdg' = jj \ {ug,Uhg)\dg dh < . 


The second integrand here may be re-written as 


\{Ug^ Uhg)\ 


J f2Ss ■ f2S>^s ■ fsTS . 

J f2 ■ hS^ ■ dy 

J F2,h-F3,hTSSS~" dy\, 


where := /2 • f 2 S^ and F 3 ,/, := /a • fsT^. 

Both F 2 .h and Fa./t are real-valued and bounded by 1 in absolute value, so Corol¬ 
lary 1 applies to give 
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I |K,w;,,)|d5< I J f2-f2S'^dfi\\ I f3-hT^df,\+2D-^/^ 

for any h. Now integrating over h, and using again that I /3 • /sT^j < 1, the right- 
hand side above turns into the bound 

J I J f2-f2S'^dfi\dh + 2D-^/^< (^J (^J h- f2S^ d^i"f dhy^'\2D-^/\ 


By Lemma 4 and the fact that E (/2 | S^) = 0, this is at most 3D This com¬ 
pletes the proof. □ 


Proof (Theorem B). We will prove that, for any function f ■. G x G —> [0,1], the 
set 


A := e G : JJf{x,y)f{gx,y)f{x,gy)dxdy>(^Jfdy - e| 

has measure at least 1 — 2v/3D“^/^/e. Applying this to / = gives Theorem B. 
Define new [—1,1]-valued functions and fjj- by the decompositions 

/ = E(/|r|) + /2^ = E(/|D5) + /3L. 

In the present setting, these are the decompositions of /2 = / and fs = f into 
‘structured’ and ‘quasirandom’ parts, as promised at the beginning of Section 3. It 
turns out that no related decomposition f° + is needed here. 

Substituting the first decomposition into the middle position of the relevant inte¬ 
gral, and then the second decomposition into the last position, we obtain 


JJ f{x,y)f{gx,y)f{x,gy)dx dy 

= 11 f{x,y)£{f\Sl^){gx,y)£{f\S'^){x,gy)dxdy 

+ ///(-, 

+ II f{x,y)m\Sx)i9x,y)f3{x,gy)dxdy. (10) 

Since E(/ | S^) is S'-invariant and E(/ | S]^) is T-invariant, the first integral on 
the right-hand side of ( 10 ) is equal to 

Jj fix, y)E(/ I S^)ix, y)E{f \ Sx)ix, y) dx dy 


for any g, and this is bounded below by (J / d/i)^ by Lemma 2. Re-arranging (10), 
it follows that 
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A2 A'■= € G : JJ f{x,y)f 2 {gx,y)f{x,gy)dxdy\ 

+ JJ f{x,y)^f\S^){gx,y)f^{x,gy)dxdy < ej. 


On the other hand, since E(/^ | S^) = E(/^ | = 0, two appeals to Propo¬ 

sition 1 give 




dff 


JJ f{x,y)E{f\S^){gx,y)fi-{x,gy)dxdy dg < 2V3D 


Hence mc^A') > 1 — 2\/^D je, by Chebyshev’s Inequality, so the proof is 
complete. □ 


5 BMZ corners 

This section proves Theorems C and D. The two proofs have much in common. We 
will explain the overarching structure of both proofs first, and then separate their 
finer details. 

In this section it now makes more sense to work with orthogonal (real) represen¬ 
tations than with unitary ones, since we will not need any representation theoretic 
results beyond Lemma 4. 

Again let {X, Sx,y, S, T) be as in Section 3. 


5.1 Decomposition into structured and quasirandom functions 

We now describe the decompositions into ‘structured’ and ‘quasirandom’ functions 
appropriate to the analysis of the family of triple forms 

J fi ■ f 2 S<^ ■ fsSSTS dy, gGG. (11) 

The functions /i, /2 and /a play different roles here, so each will need its own notion 
of ‘quasirandom’ and ‘structured’ summands, even though our ultimate interest is 
in the case /i = /2 = /a = 1^. 

As is common in this area, the appropriate notions of ‘quasirandomness’ are 
measured as smallness in certain norms. We introduce these next. 

For a bounded function / ; X —R, define 
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ll/ll®i,2 •= s^p{ J fi^^y)9{x)h{y)dfi: blU, ll^illoo < l}- 

This is a classical construction in Banach space theory: the injective tensor norm 
II ■ \\L'^(mG)iSL'^{mG)- Originates in Schatten’s work on tensor products of Banach 
spaces, where it is referred to as the ‘bound cross-norm’. See [17, Section II.3], 
or [16, Chapter 3] for a more modern treatment. It has now become popular in 
extremal combinatorics, where it is called the ‘box norm’. 

Similarly, define 


Il/ll®i,i2 ■= sup{ J f{x,y)g{y)h{x ^y)dg: UpHoo, ||/i||oo < l} 

and 

Il/ll®i2,2 •= J fix^y)9{x~^y)h{x)dfi: HpIU, ||/i||oo < l}- 

These may also be viewed as injective tensor norms, once one chooses appropriate 
‘coordinate axes’ on G x G. 

For general functions /i, /2 and /a on G x G, we will make use of Frieze and 
Kannan’s weak version of the Szemeredi Regularity Lemma [12] (see also [14, 
Proposition 2.11] for a formulation closer to the present paper). The next propo¬ 
sition gives the specific instance of this result that we need. Recall that if P is a 
partition of a set S and s G S, then V{s) denotes the cell of V which contains s. 

Proposition 2 (Weak Regularity Lemma). Given a measurable function f : X —s- 
[—1,1], and also rj > 0, there are partitions Vip and Vip 2 of G, each into at most 
exp((l/? 7 )®(^^) cells, for which the following holds. 

Define a new partition of G x G cell-wise by setting 

Qi{x, y) := Vipiy) n 'Pip 2 {x~^y), (12) 

and let Ei : L°°(p,) —> L°°{p) be the operator of conditional expectation onto 
Qi- Then 

II/- ^ 9- 

□ 

This is not the formulation of the Frieze-Kannan Regularity Lemma given in [ 12] 
or [14], but it is an easy consequence. The methods of [12] or [14] give instead a 
function 

mo 

f°{x,y) = \mh'^{y)h'f^{x~^y) 

m—l 

which approximates / in the sense that 

Il/-/“ll®i,i2 < 7, 


and where 
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• mo = 

• each Am £ [— 1 , 1 ], 

• and each and is an indicator function on G. 

For our purposes it is important to approximate / by a function that still takes values 
in [ 0 , 1 ], hence our preference for approximating by a conditional expectation of 
/ itself. To obtain suitable partitions Vi^i and V 1.12 from the function f° above, 
one simply lets Vi^i be generated by the level sets of all the functions h'^, m = 
1 , 2 ,..., mo, and similarly for P 142 using h'^. 

In recent years, techniques for decomposing a function into structured and quasir¬ 
andom parts have become quite sophisticated: see [14], for example. A more careful 
argument than ours might enable one to be more efficient in the above proposition, 
and perhaps ultimately improve the bound in Theorem Cto D = (l/e)°*^^^. How¬ 
ever, it is not the purpose of the present paper to explore this kind of enhancement, 
and we content ourselves with the partitions obtained above. 

We will also need two variants of Proposition 2 that are obtained from it by sim¬ 
ple changes of variables in G x G. Given / and 77 as above, there are also partitions 

7 ^ 2 . 1 , 'P 2 . 2 , 'P 3,2 and P 3 ,i 2 , all into at most exp((l/? 7 )°*^^)) cells, for which the fol¬ 

lowing hold: defining two new partitions of G x G by 

Q 2 {x, y) := V 2 , 2 {x) n V 2 .i{y) (13) 

and Q3{x,y) :='P3,i2{x~~^y)r\P3^2ix), (14) 

and letting : L°°{y) — L°°{y) be the operator of conditional expectation onto 
Qi for * = 2, 3, one has 

II/- ^ and II/- < 77 . 

At some points below, we will need to write out the functions E^/, i = 1,2,3 
in terms of more elementary summands. Using the individual cells of Qi, one may 
always express 



Ml 

(15) 


mi — 1 

M 2 

{^2f){x,y) = ^ h2,l,mAy)f^2,2,mAx) 

(16) 

and 

m2 — l 

M3 

{^3f){x,y) = ^ h3^2,m3ix)h3^i2,m3{x~^y), 

(17) 

where Mi = \Qi\ 

m3 = l 

for 7 = 1,2, 3, and each h, , , is a measurable function G 

_ 


[- 1 , 1 ]- 
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5.2 Estimates for structured functions 

This subsection analyzes the triple form (11) when each fi is replaced by a struc¬ 
tured approximant Eif as given by Proposition 2. For these approximants, we can 
exert very precise control over that triple form: it turns out that it hardly depends on 
g at all. This fact will result from the following. 

Lemma 6. Let hi,h 2 , (112 : G —y [—1,1] be measurable, and suppose that G is 
D-quasirandom. Then 



{x)h 2 {y)hi 2 {x ^y)dxdy 





<d-g\ 


Proof. This is essentially the implication (v) => (iv) in [13, Theorem 4.5]. Let R 
be the left-action of G on itself. The left-hand side above may be re-written as 


J hi{x)(^ J h 2 {y)hi 2 {x ^y)dy- j ^ 12 )) dx 

= J hi{x)(^{h2,hi2R^ “ (y ^ 2 ) (ydx . 

By the Cauchy-Bunyakowski-Schwartz inequality, this is at most 

y {{h2,hi2R^ )l‘^{g) ~ (y(y ’ 

and Lemma 4 bounds this by □ 

Corollary 2. Suppose that G is D-quasirandom, and that 

hi.x,y) = hi^i{y)hi^i2{x~^y), f2{x,y) = h2,iiy)h2,2{x) 

and fsix^y) = h3^2ix)h3^i2{x~^y) 

for some measurable functions /i,., : G —> [—1,1]. Then the quantity 

Pig) ■= J fi ■ f2S<^ ■ fsS^TS dp 


satisfies 


\pig)-pig')\<2D-^/^ Vg,fGG. 

Proof For this choice of functions f, one has 

pig) = JJihi,ih2,i)iy) ■ ihi ,12/13,i2)(a: ^y) ■ (/i2,2/13,2)(5a;) dxdy. 


Applying Lemma 6 to this integral, we find that it lies within D of 
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^ 1 , 1^2 


^ 1 , 12 ^ 


3,12 



2,2^3,2 


) 


for all 5. □ 

Proposition 3. Lef e > 0, and let Qi, i = 1,2,3, be partitions as in (12), (13) 
and (14). Assume that G is D-quasirandomfor some 

Let the sets E and B be as in Theorem C. Then B contains the set 
C-.= [g&G-. p{ECl {g-\e) ■ E n {g-\g-^) ■ E) 

> J (Eilis) • (E2l£;)^® • (E3li5)5ST9 dp - e/2}. 

Proof. According to the decompositions (15)-(17), the quantity 

:= y (EiIb) • (E2l£)5® • (E3l£)^®T9 dp 

is a sum of at most IQ1IIQ2IIQ3I quantities having the form p treated by Corol¬ 
lary 2. Therefore, by that corollary, 'f{g) varies by at most 

2 \Qi\\Q 2 \\Q 3 \D-^'^ <e/2 


as g varies in G. Therefore 

p{E<A{g~^,e) ■ E fA {g~'^, g~^) ■ E) > ipi^e) -e = J ■ E2IB ■ EslEdp-e 
for all g € G. 

Finally, an appeal to Lemma 2 gives 


Ei 1_E ■ E2l_E ■ E3l£; dp > 



■ Ei1_e ■ E 2l_E ■ E3IB dp > p{E)'^. 


Hence g € B for all g G C. □ 

In the remainder of the proofs of Theorems C and D, we show that the set G from 
the above corollary is large in the required senses, rather than handle the set B from 
Theorem C itself. 
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5.3 Completed proof of Theorem C 

In order to use the approximants E^/i to the functions fi in estimating the triple 
forms (11), we need to convert a bound on the norms such as IHIgij ^ into some more 
direct control on those triple forms. That control is given by the next proposition. 

Proposition 4. Let fi, f 2 , f 3 ■ X — > [—1; 1] measurable. Then 


J I J fi ■ f2Ss ■ dfi 




< ^21:1-1/4 + ^mm{||/i||^^^^, 11/211^^ 

This may be viewed as the analog of Proposition 1 for BMZ corners. It will be 
deduced from the following intermediate estimates. 

Lemma 7. For any f : X —> [—1,1], the following inequalities hold: 



J II E(/. /s'^ 1 r^)||i dh < 79-1/2 ^ 

(19) 


J II E(/ . /(ST)'^ 1 ^5)||i dh < 79-1/2 ^ 

(20) 

and 

1 II E(/ . /(ST)'^ 1 9;|)||2 dh < 79-1/2 ^ 

(21) 


Proof. We first prove (19). Define a new probability (G x G)-space as follows: let 
Y := G X G X G, let z/ := my, and let the two generating actions be 

S^(x,y,z) = {gx,y,z) and {x,y, z) = {x, gy, gz). 

(In ergodic-theory terms, this is the relative product of two copies of {X, /r, S', T) 
over Then a simple calculation shows that 

\\£{f-fS^\Sl)\\l = j F-FS^du = {F,FS^)^,^^^, 


where 

F(x,y,z) := f{x,y)f{x,z). 

Let H := E(T' | Sy). Applying Lemma 4 to L' and S, it follows that 


J ||E(/./s‘|ri;)||?ci/i = J 
< /l<f. 




dh 

d'l+IT.ffLMi 
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On the other hand, H is a function of only the coordinates y and z for a point 
{x,y,z) G Y, and therefore 




L^{u)\ 


< 


< 


fix, y)fix, z)H (y, z) dx dy dz 
JJ fix,y) ■ ifix,z)H{y,z))dxdy 


dz 


dz = 


The proof of (20) is very similar. One uses the same auxiliary (G x G)-space 
(y, ly, S, T) and function F as before, but now one proceeds from the estimate 

I \\E{f-f{ST)^\S]^)\\ldh = II F-F{Sf)U,ydh 

= I {F,F{Sff)^,^^^ dh<D-F^ + \ ! F-E{F\sf)dv . 

The function iJ' := E(F | Sy^) can be written as a function of only x~^y and x~^z 
for a point {x, y, z) € Y. Using this, the change of variables w := x~^z gives 

^ F- H'diy 


< 


< 


fix,y)fix,z)H'{x ^y,x ^z)dxdydz 
f{x, y)f{x, xw)H'{x~^y, w) dx dy dtu 
II fix,y)-ifix,xw)H'{x~^y,w))dxdy 


dw 


9l2,2^^=\\fUl2.2- 


Finally, inequality (21) is simply (20) with the roles of S and T reversed. □ 

Proof (Proposition4). We first prove the bound that uses either ||/2||0j^ ^ or H/aHigijj ^ 
Let Ug := f 2 S^ ■ By Lemma 1, it suffices to prove that 

II \ {ug,Ug')\dgdg' = II \ {ug,Uhg)\dg dh 

< + Y^minlll/all^^ ll/all®,, J. 


For any g,h G G, one has 

















20 


Tim Austin 


\{Ug,Ung)\ = I J f2S^ ■ f2S^^ ■ hS^TS . f^S^ 9 Th 9 

= J f2 ■ f2S^ ■ hTS . d^l 




where := f 2 -f 2 S^ := /a •/a (S'T)'', both of which take values in [—1,1]. 

Therefore Lemma 4 and the Cauchy-Bunyakowski-Schwartz Inequality give 


II\{ug,UHg)\dgdh< I \{E{Fn\S],),E{F;,\S],))\dh + 

< I ||E(F;,|2;5)||2||E(F'|i:5)||2d/t + i^-i/2 



Each of the integrals inside this last square root is certainly at most 1. Therefore, by 
Lemma 7, the last line above may be bounded by 


either \\f2\\0^_^ + D-^/^ +D < ^||/2||®,_, + 2D 

Of ^11/311®,.,. + < \/ll/3||®,,,, + 277-V4. 

Finally, the proof of the bound using ll/ijj^j^ i® exactly analogous to the case 
of ||/a||(g)j2 2 once one makes the substitution g' := g~^ to write 


y I y /i • f 2 s^ ■ /3^®TSdp| dg= y I y /3 • • hs^Ta d^| d^'. 


□ 

Proof (Theorem C). Let / := 1 _e and g := e®/(4 • 6^). Then there are certainly 
values of D satisfying (6) for which 

^2L»-i/4 + y^ < eV6. (22) 

For this function / and error tolerance g, let the partitions Qi, i = 1,2, 3, and 
corresponding operators E^ be given by Proposition 2 and its two variants. Then 
|Qi| < exp(2(l/77)°(^i) for each i, and so there are values of D satisfying (6) for 
which (18) also holds. Therefore Proposition 3 applies, and so it suffices to show 
that the set C from that proposition has measure at least 1 — e. We will in fact prove 
that 


y I y ./ • • fS^TO dp - y (Ei/) . (E2/)5® • {E^f)S<^T^ dp| d^ < eV2. 
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The desired lower bound on mc{C) follows from this by Chebyshev’s Inequality. 

Let = f — ^if for each i. By the triangle inequality, the above estimate is a 
consequence of the following: 


and 


J I J f-fS^-fi-SaT^df,\dg<ey6, 

j\j f- ■ (E3/)59rsd/r| dg < 

j\j ft- i^ 2 f)S^ ■ (E3/)5®T9 dg\ dg < eV6. 


These are all now implied by Proposition 4, together with (22). 


□ 


5.4 Anti-neighbourhoods and syndeticity 

The proof of Theorem D will be based on having a large supply of fairly syndetic 
subsets of a quasirandom group ready to hand. These subsets will be obtained from 
a simple construction in terms of representations. 

Let TT : G 14 be an orthogonal representation, let u,v G V, and let e > 0. If 
= 0 (that is, TT contains no copy of the identity representation), then let 

A{TT,u,v,e) ■={gGG: \{u,tt3v)\ < e}, 

and call this the (tt, u, v, e)-anti-neighbourhood. For general tt, let 

A{TT,u,v,e) := {g £ G ■. — {P'^u, P'^v)\ < e}. 

If G is P-quasirandom for some large D, and u, v are unit vectors, then the 
corresponding anti-neighbourhoods are quite large: Lemma 4 and Chebyshev’s In¬ 
equality imply that 


D-^ 

mG(A(Tr, u, V, e)) > 1-j-. (23) 

This is very intuitive. If tt is irreducible with large dimension d, then the orbit points 
TT^v should be fairly evenly spread around the high-dimensional unit sphere in V, 
and so most of them will be nearly orthogonal to any fixed direction u. 

The present section shows that, if G is highly quasirandom, then anti-neighbourhoods 
are also fairly syndetic. Moreover, one can intersect a controlled number of anti¬ 
neighbourhoods, and that intersection is still fairly syndetic. This is not implied 
solely by the largeness of those intersections, but it will follow from some sim¬ 
ple inner-product estimates. In the next section, the syndeticity of the set in The¬ 
orem D will be proved by showing that it contains such an intersection of anti¬ 
neighbourhoods. 
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Let us begin with a rough sketch of why a single anti-neighbourhood should 
be fairly syndetic, before carefully proving the result we need about intersections. 
The key point is that, if d is very large and we choose a moderately large number 
of group elements hi, hk independently at random, then with high probability 
the image-points tt^’ u will all be nearly orthogonal to one another. The intuition 
here is the same as that above: each of the random points tt^'u should be fairly 
evenly distributed around the high-dimensional unit sphere of V, independently of 
the others. However, having obtained some /i^’s for which the vectors are all 
nearly orthogonal, this then forces any other unit vector to be nearly orthogonal to 
at least some of them. In particular, for any g G G, there is an i such that tt^'u and 
7r®u are nearly orthogonal. This implies that u is nearly orthogonal to and 

hence that g is in hi ■ u, v, e) for some small e. Crucially, after fixing the right 
choice of hi, ..., hk, this argument works for every g G G. 

Such ‘near orthogonality’ will be deduced using the following. 

Lemma 8. Let V be a real Hilbert space and let v and u^,..., u™ be unit vectors 
in V. Suppose that 


\{u'',u^)\ < Xjvr? whenever i ^ j. 


Then 

< 2 . 

i<m 

Proof. Let at := {v, u*) for each i, so one always has |a.| < 1, and let 

w := OiU*. 

i<m 


The assumed inequalities give 

11^11^= X] aiUjinfu^) + 1, 

i,j<m i<m i<m 

and hence _ 

X 

i<m 

This implies that J2i<m of < 2. □ 

In order to study intersections of anti-neighbourhoods, we will actually need the 
following crude corollary which concerns several Hilbert spaces simultaneously. 

Corollary 3. Let Vi, ., 14 be real Hilbert spaces, and let V£ and uj,, u™ be 
unit vectors in Vefor each L Suppose that 

\{ug,u^^}\ < \/rr? whenever ^ < k, i j. 

Then there is some i < m such that \ {vf,,u\)\ < iJ2k/m for all i < k. 


X' 

i<.m 


= {v, w) < ||u|| ||r(;|| < 
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Proof. Summing the inequalities proved in the preceding lemma gives 

i<m £<.k 


□ 

Corollary 4. Let p > 0 and k > 1, and set K := \2k/rf' + 1] and D := K^k + 1. 
Let -Ki : G r\ V(, for i = 1,2,..., k be orthogonal representations that are all 
D-quasirandom, and let ue,ve € Vibe unit vectors for each L Then 

k 

A := Pi A{-Ki,ui,vi,ri) 

1=1 


is K-syndetic. 

Proof. Let 

k 

A' := P A{-Ki, ui, ui, 1/K^). 

1=1 

Let hi,..., hx be a iL-tuple of elements of G drawn at random from the measure 
. Then the estimate (23) and a first-moment bound give 

gA'Wi^ j in {1,2,..., K}] 

k 

>l-J2Y.^G{G\A{7ri,ui,ui,l/K^)) >l-K^kD-^>0. 
^=1 

This implies that there exists a iL-tuple hi,..., hx in G witnessing the above 
event, hence such that 

\{'K^'ui,'K^^ui)\ < \/K^ whenever^ < k, i f j. 

Therefore, for any g € G, Corollary 3 promises some i < K for which 

\{TT^'ui,TT^Vi)\ = \{u£,Tr'^' ^ve)\ < i/2k/K < rj Vf < fc, 
so g e {hi,... ,hK} ■ A. □ 


5.5 Completed proof of Theorem D 

The next step is the following rather technical proposition. 

Proposition 5. Let e > 0 and n € N. Set k := [4/e^], and now set g := l/(3fc)®. 
Suppose that G is D-quasirandom for some D > APf jg^. 
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For £ = 2, 3, let /^, /| and be three [—1, l]-valued functions, and suppose 
that 

||/|||®i^<?7 and <rj- 

Finally, let 

Q := {g e G : \ j fi ■ d/t| < e} for £ = 2,3. 

Then there are elements hi,... ,hk G G and some auxiliary [—1, f\-valuedfunc¬ 
tions 

Flij and forl<i<j<k 

such that the set 


E:= f| {A{T, Fl^^, Fl,^^, r;) n A{T, FI,^^ , FI,^^ , r,)) 


l<i<j<k 


satisfies 

Ec{hf\...,hf^}-iC2nC3). 

Remark!. Letting m := [2fc^/77^ + l], the set G above is m-syndetic by Corollary 4 
provided D large enough, and so the above conclusion implies that the intersection 
C 2 n G3 is (mfc)-syndetic. However, this fact alone is not quite what we need for 
the proof of Theorem D: ultimately, that will require the syndeticity of the smaller 
intersection Gi fl G2 fl G3 for some other set Gi. In order to prove that, it will be 
important to have an explicit ‘witness’ to the syndeticity of C 2 H G3 in the form 
of an intersection of anti-neighbourhoods, such as E above. This is why the above 
proposition is formulated as it is. <1 

Proof We will prove that if hi, ..., hk are chosen independently at random from 
the Haar measure mg, then with positive probability one obtains a tuple for which 
the remaining objects required by the proposition also exist. 

Step 1. For such a random choice of hi, ..., hk, each difference hjh~^ for 
i f j is also a uniform random element of G, and so Lemma 7 gives the estimates 


mf^-f^S^FT^\E]^)\\ldhi---dhk-idhk<D-F^+p 


and 


//•••/ WHfl ■ \S],)\\ldhi---dhk-idhk < +TJ 

for each i f j. 

Our assumptions imply that and so the right-hand sides above are 

all less than 2p. Therefore, if hi, ..., hk are chosen randomly as described above, 
then, by Chebyshev’s Inequality and a first-moment bound, the event that 
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II E(/| • I Sl)\\l < for all * ^ j 

and \\£{fl-fl{STp^"\El)\\l<^ for alH ^ j (24) 

has probability at least 

1 - = 1 - 2^fcV(3fc)'‘ > 0. 


Therefore there exists a tuple hi, hk € G for which the inequalities in (24) 
all hold simultaneously. 

Step 2. We now define the required auxiliary functions as follows: 




and := 

and := 


whenever l<i< j <k. 


Step 3. Having chosen those auxiliary functions, let E be as as in the statement 
of the proposition. Suppose that g G E. We must show that hig G (72 H Ca for some 
i < k. 

For each g G G and £ = 2,3, define 


ul := fiS<^ ■ 

For any i < j in {1, 2,... ,k} and £ = 2, 3, we have 


\{u 


e 

hig^ 



J . fishj9 . f^S^igrph^g , flghigph^g 
J dp 

J Fl,^j ■ dp . 


Since g G E, in particular g G A(T, F^^ j,E^^ j,g), and so the above is at most 


J E(F2^^.|2;5)E(F3^jr5)dp 

If £ = 2 then this expression is bounded by 


V- 


mFL,j I ^x)ll2 + V = l|E(/|5^^ • I r5)||2 + V 

= IIE(/2. I 2:5)112 + P < ^ + P < 3^ = l/fc5 

by (24). Similarly, if £ = 3 then it is bounded by 

l|E(i"3«-1 ^x)ll2 +V= ||E(/| • I 2:5)112 + V < l/k\ 
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Thus, we have shown that 

whenever i £ = 2,3. 

We may therefore apply Corollary 3 to the inner products 

{fluij, i = l,2,...,k, f = 2,3 

to conclude that, for any g G E, there is at least one i < k for which 

|(/i,u^.g)| < < £ and |(/f,u^.g)| < e. 

For this choice of i one has hig € (72 H C^, as required. □ 

Proof (Theorem D). Let / := 1^;. As for Theorem C, this proof will be based 
on three different decompositions of / as given by Proposition 2. However, a new 
complication here is that the partition Qi will need to be chosen after Q 2 and Q 3 , 
and considerably finer than those others. 

In the course of the proof, we will meet three points at which we require a lower 
bound on D. All of these lower bounds will be satisfied for some D as in (7), so 
there is a choice of D of the form in (7) for which the whole proof can be carried 
out. 

Step 1. Set k := [36/e^] and 77 := l/(3fc)®. Let Q 2 and Q 3 be partitions as 
given by the two variants of Proposition 2 for this error tolerance 77 , and consider 
the resulting decompositions 

f = f^ + E2/ = f(t + E3/. 


Let 

and 


C 2 := {5 e G : \ J f- ■ fS^TS dM| < e/3} 

C3 := {5 e G : \ J f- (E 2 /) 5 S • dM| < e/3} 


Given the above choice of k and 77 , there is a D as in (7) which satisfies D > 
Ek^/rf (indeed, at this point (7) leaves vastly more room than we need). Therefore 
Proposition 5 applies to give hi,... ,hk G G and some auxiliary [—1, l]-valued 
functions 


^ and for 1 < i < j < fc 


Po o ji rr( ^ Pr 

-1 


such that E C {h^ .,h^ } • (G 2 fl G 3 ), where 

E= fl (A(T, El,^^ , , 77 ) n A(T, , 77 )). 
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Let Mi ■= \ Qi\iox i = 2,?>. 

Step 2. Now set rf := let Qi be as given by Proposition 2 for this 

error tolerance 77 ', and consider the decomposition 

f = f^ + El/. 


Let 

Cl := {5 e G : I y /i^ • (E2/)5® • dp| < e/s}. 

Observe that 

V' > (e/6) exp ( - 4 ( 1 / 77 )°(i)) > exp ( — 
for e £ (0,1/2). Therefore 


16|gip|g2p|g3p ^ 16exp(4(l/?7')°(^) +8(1/7?)°(^)) 


< exp (exp ((l/e)®^^^)) 


for all e £ (0,1/2), and so there are values of V satisfying (7) for which (18) 
holds for these partitions Qi, S 2 and Q 3 . (This is the only point at which we need 
the double exponential in (7).) Therefore Proposition 3 applies, and so it suffices 
to show that the set C from that proposition is 7f-syndetic. Moreover, that set C 
clearly contains Ci fl C 2 H C 3 , so it suffices to show that this triple intersection is 
iT-syndetic. 

Step 3. We now make use of the decompositions (16) and (17). Substituting 
these into the integral that appears inside the definition of Ci, we obtain 


m2,m3 


2,1,m 2 


(?/)^3,12,m3 {X ^y)){h2,2 ,m 2 ^3, 2 ,m 3 ){gx)dx dy. 


(25) 


Let 

ll^m2,m3{x,y) := f^{x,y)h2,l,m2{y)h3,12,m3{x~^y), 

let 

^7712,^3(7^5^) ■— (^2,2,7772 ^3,2,7773 ) (7^) 

(so (p depends only nominally on y), and let 

E := Pj A(^S,1prn2,m3j Pm2,m2,y ^ ■ 

7772,7773 

We will now show that E' <£ Ci, so suppose that g £ E'. Then the definition of 
E' and the expression (25) give 
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Jf^ ■ {E2f)SS ■ {E,f)S<>TO dfi\ < E I / 

m 2 ,m 3 

— ^ ^ J '4’m2,m3 djj, j (^7712,m 3 d^ 


+ v 


< 


E I j + e/6. 


Substituting from the definition of V'm 2 ,m 3 , this is 


E I // /i^(^^y)^2,i ,m2 (y)^3,i2,m3y) ti^dy +e/6 

m 2 ,m 3 

< MaMallA^II^^ + e/6 < M2M3A + e/6 = e/ 3 , 

so g G Cl- 

Step 4. Finally, letting 

k k 

E" = f]h-^E' = f] f| A{S,^„,,ipm2,m3S^\g'), 

i—\ m 2 ,m 3 


2=1 


it follows that 


f; n f;" c {h^\ ..., h-^} ■ (Ci n C2 n C3) c {h^\..., h-^} ■ c. 


On the other hand. 


EDE" 




l<2<J''<fe 


k 

nn n ^( 5 , V^m2,m3 ; ^m2,m3^ % ^ ) ■ 

2=1 m 2 ,m 3 


Since e < 1/2, this is an intersection of at most 

2k^ + kM2M3 < 2k^ + /cexp(4(l/77)®^^^) <exp((l/e)0«) 
anti-neighbourhoods for e € (0,1/2). On the other hand, 

77 , 77 ' > exp(-(l/e)°(^)). 


Let 


Kq := [2(2fc^-I-fcM2M3)/min{77,?7'}^-f 1 ] < exp ((l/e)°^^^). 
There is a O satisfying (7) for which 
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D > K^{2P + kM2M3) + 1. 

Therefore Corollary 4 can be applied to deduce that this intersection is still Kq- 
syndetic. Therefore C is TT-syndetic for 

K = Kok = exp ((l/£:)°(^)), 


as required. □ 

Remark 3. The above proof uses three different decompositions of /, one for each 
the three positions in the triple form (11). However, these decompositions do not 
have equal status: the decomposition / = //* + Ei/ (corresponding to the first po¬ 
sition) requires a much finer partition Qi, and depends on having already obtained 
the partitions Q 2 and Q 3 corresponding to the second and third positions. This con¬ 
trasts with the proof of Theorem C, where the three positions in the triple form (11) 
have roughly equal status. 

The reason for this difference can be seen in the proof of Proposition 4. First, 
we proved the bound that uses the norm ||/ 2||(|)^2 or ll/allifij^ 3 ; then, the bound 
that uses ll/iH^j obtained by the same argument upon changing variables 

to g' := g~^. However, this change of variables converts left-syndeticity to right- 
syndeticity, so we cannot use it in the same way to analyze the set Ci in the proof 
of Theorem D. Instead, we must first replace / with its structured approximations 
E 2 / (in the second position) and E 3 / (in the third position), and then use the special 
structure of those approximations to analyze the contribution of / in the first position 
of the triple form, without changing variables. 

This discussion also suggests why our methods fail to answer Question 1 from 
the Introduction (about syndeticity in the setting of Theorem B). The current version 
of Theorem B involves an estimate of the triple form for naive corners which uses 
an integral of the kind appearing in Corollary 1. That integral features both T® 
and . Once again, the appearance of converts left-syndetic sets into right- 
syndetic sets, and so it is not clear how to obtain control on this integral on any 
particular anti-neighbourhood. <1 
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